Polygonized Silhouettes and Polygon Coding Based Feature Representation for Human Action Recognition

نویسندگان

چکیده

The characteristics of human silhouette shape can be used for action recognition and classification. In this paper, a novel feature extraction method the silhouette-based classification actions in videos is proposed. proposed based on polygonization images coding. Since conventional generation methods do not satisfy integrity silhouettes, Yolact++ modified as generator. Our innovative approach masks are silhouettes to overcome problem. For purpose, new image form called Poly Silhouette (PoS), Polygonization (PoG) algorithm Polygon Coding (PoC) have been developed. step on, but similar curve polygonization. It fast, adaptable, accurate contour coordinates PoS images. PoCs were generated by projecting each edge vector from corner onto angular areas codes formed. These grouped into k-mers genetic algorithms features. guarantees that vectors equal length any video. Thus, no additional required dimensionality By using different k-mer lengths, accuracy versus computation time was analyzed depicted figures. developed tested HMDB51 & UCF101 datasets: SVM 20.98%, 1.63% k-NN 4.96%, 6.83%, respectively, significant improvements achieved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature extraction and representation for human action recognition

Human action recognition, as one of the most important topics in computer vision, has been extensively researched during the last decades; however, it is still regarded as a challenging task especially in realistic scenarios. The difficulties mainly result from the huge intra-class variation, background clutter, occlusions, illumination changes and noise. In this thesis, we aim to enhance human...

متن کامل

A polygon soup representation for multiview coding

This paper presents a polygon soup representation for multiview data. Starting from a sequence of multi-view video plus depth (MVD) data, the proposed quad-based representation takes into account, in a unified manner, different issues such as compactness, compression, and intermediate view synthesis. The representation is extracted from MVD data in two steps. First, a set of 3D quads is extract...

متن کامل

Silhouette-Edge-Based Descriptor for Human Action Representation and Recognition

Extraction and representation of postures and/or gestures from human activities in videos have been a focus of research in this area of action recognition. With various applications cropping up from different fields, this paper seeks to improve the performance of these action recognition machines by proposing a shape-based silhouette-edge descriptor for the human body. Information entropy, a me...

متن کامل

Multi-Scale Locality-Constrained Spatiotemporal Coding for Local Feature Based Human Action Recognition

We propose a Multiscale Locality-Constrained Spatiotemporal Coding (MLSC) method to improve the traditional bag of features (BoF) algorithm which ignores the spatiotemporal relationship of local features for human action recognition in video. To model this spatiotemporal relationship, MLSC involves the spatiotemporal position of local feature into feature coding processing. It projects local fe...

متن کامل

Human Action Recognition Based on Global Gist Feature and Local Patch Coding

Human action recognition has been a widely studied topic in the field of computer. However challenging problems exist for both local and global methods to classify human actions. Local methods usually ignore the structure information among local descriptors. Global methods generally have difficulties in occlusion and background clutter. To solve these problems, a novel combination representatio...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2023

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2023.3283458